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Abstract 

A central theme in current work in quantum information and quantum 
foundations is to see quantum mechanics as occupying one point in a 
space of possible theories, and to use this perspective to understand the 
special features and properties which single it out, and the possibilities 
for alternative theories. Two formalisms which have been used in this 
context are operational theories, and categorical quantum mechanics. The 
aim of the present paper is to establish strong connections between these 
two formalisms. We show how models of categorical quantum mechanics 
have representations as operational theories. We then show how non- 
locality can be formulated at this level of generality, and study a number 
of examples from this point of view, including Hilbert spaces, sets and 
relations, and stochastic maps. The local, quantum, and no-signalling 
models are characterized in these terms. 

1 Introduction 

A central theme in current work in quantum information and quantum founda- 
tions is to see quantum mechanics as occupying one point in a space of possible 
theories, and to use this perspective to understand the special features and 
properties which single it out, and the possibilities for alternative theories. 

Two formalisms which have been used in this context are operational theories 
|47[ 1401 1511 146) , and categorical quantum mechanics [51 [7J . 

• Operational theories allow general formulations of results in quantum 
foundations and quantum information QTl HH EH]- They also play a 
prominent role in current work on axiomatizations of quantum mechanics 

[nana Eg. 

• Categorical quantum mechanics enables a high-level approach to quantum 
information and quantum foundations, which can be presented in terms 
of string-diagram representations of structures in monoidal categories [7J . 
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This has proved very effective in providing a conceptually illuminating and 
technically powerful perspective on a range of topics, including quantum 
protocols [5J, entanglement [24] . measurement-based quantum computing 
[2"B] . no-cloning [T], and non-locality |22) . 

The aim of the present paper is to establish strong connections between 
these two formalisms. We shall begin by reviewing operational theories. We 
then show how a proper formulation of compound systems within the operational 
framework leads to a view of operational theories as representations of monoidal 
categories of a particular form. We call these operational representations. 

We then review some elements of categorical quantum mechanics, and show 
how monoidal dagger categories, equipped with a trace ideal, give rise to opera- 
tional representations. Thus there is a general passage from categorical quantum 
mechanics to operational theories. 

We go on to show how non-locality can be formulated at this level of gener- 
ality, and study a number of examples from this point of view, including Hilbert 
spaces, sets and relations, and stochastic maps. The local, quantum, and no- 
signalling models are characterized in these terms. 

We shall assume some familiarity with the linear-algebraic formalism of 
quantum mechanics, and with the first notions of category theory. To make 
the paper reasonably self-contained, we include an appendix which reviews the 
basic definitions of monoidal categories, functors and natural transformations. 

We also include another appendix which proves a number of technical results 
on trace ideals. These are mathematically interesting, but would break up the 
flow of ideas in the main body of the paper. 

2 Why operational theories? 

Before proceeding to a formal description of operational theories, it may be 
useful to discuss the motivation for studying them. 

As we see it, operational theories have the following attractions: 

• Firstly, they focus on the empirical content of theories, and the means 
by which we can gain knowledge of the microphysical world. Any viable 
theory must account for this content. 

• By focussing on this empirical and observational content, operational theo- 
ries allow meaningful results to be formulated and proved about the 'space 
of theories' as a whole. At a stage in the development of physics where the 
next step is far from clear, this is a useful perspective, which may prove 
useful in finding 'deeper' theories. 

• Indeed, the operational framework has proved fruitful as a basis for general 
results, e.g. on the information processing capabilities of theories under 
various assumptions [TTJ [T^l [TU] ; and provides the setting for recent work 
on axiomatic reconstructions of quantum mechanics [3S1 Qj|l HE] . 
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On the debit side, operational theories attract criticism on philosophical 
grounds. They are seen as linked to an 'instrumentalist' or 'epistemic' view of 
physics, as opposed to a 'realistic' approach. From our perspective, the fact 
that we study operational theories does not indicate any such philosophical 
commitment. Rather, they are pragmatically useful for the reasons already 
mentioned, and can be seen as expressing some irreducible minimum of empirical 
content, which will have to be accounted for by any presumptive 'deeper' theory. 

3 Operational theories formalized 

An operational theory is formulated in terms of directly accessible 'operations', 
which can be performed e.g. in a laboratory. We assume there are several 
different types of system, A, B, C, etc. For each system type A, the theory 
specifies the following: 

• A set of preparations Pa which produce systems of that type. 

• A set Ta of transformations which may be performed on systems of type 
A. More generally, we can consider transformations Ta,b which can be 
performed on systems of type A to produce systems of type B. 

• A set of measurements Ma which can be performed on systems of that 
type. 

Each measurement has a set of possible outcomes. In this paper, we shall only 
consider 'finite-dimensional' theories, or parts of theories. This means that 
each measurement has only finitely many possible outcomes. For convenience, 
we shall assume a fixed infinite set of outcomes O, which will apply to all 
measurements. Any measurement with a finite set of outcomes O'CO can be 
represented using O, where those outcomes outside O' have zero probability of 
occurring. 

The empirical predictions of the theory are given by its evaluation rule, 
which is a function 

v A : Pa x M A x O -> [0, 1] 

which assigns a probability ua(p, m, o) to the event that a system of type A, 
prepared by p, yields outcome o when measurement m is performed on it. 

For each choice of p and m, the function va(p,tti,—) defines a probability 
distribution on outcomes. We shall use the function 

d,A ■ Pa x Ma — > T> <1a (p, m) : o i-> va(p, m, o) 
where V is the set of probability distributions of finite support on O. 

3.1 Compound systems 

An important additional ingredient is to give an account of compound systems, 
i.e. putting systems, possibly space-like separated, together. 
This leads to the following additional requirements. 
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• For each pair of system types A, B, a compound system type AB. 

• Ways of combining preparations, measurements, etc. on A and B to yield 
corresponding operations on the compound system AB. 

Moreover, these operations should be subject to axioms yielding a coherent 
mathematical structure on these notions. 

Rather than trying to develop such 'meta-operations' and axioms from first 
principles, we see the essential elements as provided by monoidal categories, 
which have been developed extensively as a setting for quantum mechanics and 
quantum information in the categorical quantum mechanics programme [6l [7] . 

We shall therefore proceed by giving a precise formulation of operational 
theories with compound system structure as a certain class of representations 
of monoidal categories, which we call operational representations. 

3.2 Operational representations: concrete description 

Before giving the 'official' definition of operational representation, which is 
mathematically elegant but a little abstract, we shall give a more concrete ac- 
count, which shows the naturalness of the ideas, and also indicates why guidance 
from category theory is helpful in finding the right structural axioms. 

For each system type A, we can gather the relevant data provided by an 
operational theory into a single structure 

{P A ,M A ,d A : P A x M A -*V). 

This immediately suggests the notion of Chu space [14j [20] , which has received 
quite extensive development 53 , and was applied to the modelling of physical 
systems in [2] . Indeed, it can be seen as a generalization of the notion of model of 
a physical system proposed by Mackey in his influential work on the foundations 
of quantum mechanics |47j . 

There is a natural equivalence relation on preparations: p is equivalent to 
p', where p,p' € P Al if for all m € M A : 

d A (p,m) = d A (p',m). 

This is exactly the notion of extensional equivalence in Chu spaces [2] . We can 
regard states operationally as equivalence classes of preparations [50] . 

In an entirely symmetric fashion, there is an equivalence relation on mea- 
surements. We define m to be equivalent to to', where m,mf £ M A , if for all 
P£P A : 

d A (p,m) = d A (p,m'). 

We can regard observables operationally as equivalence classes of measurements. 

Quotienting an operational system {P A , M A ,d A ) by these equivalences cor- 
responds to the biextensional collapse of a Chu space [2]. 

Having identified operational systems with Chu spaces, we now turn to mor- 
phisms. A transformation in T A b induces a map /* : P A — > Pb- That is, 
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preparing a system of type A according to preparation procedure p, and then 
subjecting it to a transformation procedure t resulting in a system of type B, is 
itself a procedure for preparing a system of type B. 

Such a transformation can also be seen as a procedure for converting mea- 
surements of type B into measurements of type A: given a measurement m G 
M B , to apply it to a state prepared by p € Pa, we apply the transformation 
t to obtain a preparation of type B, to which m can be applied. Thus we can 
also associate a map /* : M B — > M A to the transformation t. The formal rela- 
tionship that links the two maps /» and /* is that, whether we measure f*(p) 
with to, or p with /*(to), we should observe the same probability distribution 
on outcomes: 

d B (f*(p),m) = d A (p,f*(m)). (1) 

This can be seen as an abstract form of the relationship between the Schrodinger 
and Heisenberg 'pictures' of quantum dynamics. 

The equation ([T]) says exactly that the pair of maps (/*,/*) defines a mor- 
phism of Chu spaces 

(/*,/*): (P A ,M A ,d A ) -> {P B ,M B ,d B ). 

Thus we see that in an entirely natural way, we can associate an operational 
theory with a sub-category of Chu spaces, more precisely of Chu(Set,2?) [S3]. 
This sub-category will not in general be full, since not every Chu morphism will 
arise from a transformation in the theory. 

However, this does not yet provide an account of compound systems. While 
Chu spaces have a standard monoidal structure, and indeed form ^-autonomous 
categories [20], we should not in general expect that operational theories will 
give rise to monoidal sub-categories of Chu spaces. Rather, we should see the 
notion of compound system as an important degree of freedom, which is to be 
specified by the theory. 

Thus given operational systems A = (P A ,M A ,d A ) and B = (P B ,M B ,d B ), 
we should be able to form a system A® B — (Pa®b, M A ® B , d A ® B ). 

What general properties should such a notion satisfy? One important re- 
quirement, which appears in one form or another in the various formulations of 
operational theories, is to have an inclusion of pure tensors. This is given by 
maps 

l a b : P A x P B -> P A » B , i% B '■ M A x M B -> M A(g>B - 

For readability, we shall write p (g> p' rather than c A B (p,p'), and similarly for 
measurements . 

The fundamental property which this inclusion must satisfy relates to the 
evaluation. For all p € Pa, p' £ Pb, to € M A , m' € M B , we must have: 

d At g, B {p<g>p' 1 m® to') = d A {p,m) ■ d B {p' ,m'). (2) 

This expresses the probabilistic independence of pure tensors. Conceptually, 
pure tensors arise by preparing states or performing measurements indepen- 
dently on subsystems. 
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In addition, there are a number of coherence conditions which are needed 
to get a mathematically robust notion. Rather than writing these down in an 
ad hoc fashion, we shall now turn to a more systematic way of defining the 
categorical structure of operational theories, in which these conditions arise 
naturally from standard notions. 

3.3 Operational representations: functorial formulation 

We shall now take a different view, in which the structure of an operational 
theory arises from a symmetric monoidal category, which we think of as a process 
category. The operational theory will amount to a certain form of representation 
of this process category. The receiving category for the representation will be 
(Set, x, 1), viewed as a symmetric monoidal category. 

Given a symmetric monoidal category C, an operational representation of 
C is specified by the following data: 

• A symmetric monoidal sub-category Ct of C. This will usually have the 
same objects as C, and only those morphisms which correspond to admis- 
sible transformations. 

• A symmetric monoidal functor P : C t — > Set which represents, for each 
object A of C t , viewed as a type of system, the corresponding set of 
preparations or states. 

• A contravariant symmetric monoidal functor M : C op — > Set which for 
each A represents the measurements on A. Note that C° p is a symmetric 
monoidal category. 

• A dinatural symmetric monoidal transformation 

d:PxM4K„ 

which gives the evaluation rule of the theory. Here K-p is the constant 
functor valued at T>. Note that a constant symmetric monoidal functor 
valued at a set M is just a commutative monoid (M, •, I) in Set. We take 
V to be a commutative monoid under pointwise multiplication. 

We shall assume that the functors P, M are embeddings, i.e. injective on objects 
and faithful. 

Let us now unpack this definition. 

• The general point of view is that the structure of the operational theory 
is controlled by the 'abstract' category C. The types of the theory are the 
objects of C t . 

• Rather than a single set of preparations, we have a variable set P, which 
for each type A gives us a set P^- Moreover, this acts functorially on 
the admissible transformations /: A — > B in C t to produce functions 
/* : Pa — > Pb, where /* := P(/). Thus these functions take preparations 
on A to preparations on B, as already discussed. 
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• Similarly, the functor M specifies a variable set of measurements for 
each system type A. The contravariant action of this functor is again as 
expected from our previous discussion. 

The first new ingredient which picks up the issue of monoidal structure is 
that P and M are required to be monoidal functors. The fact that P and M are 
monoidal means that there are natural transformations 



l a,b ■ 



Pa x P, 



L A.B ■ 



\ A X 



A®B- 



i.e. inclusions of pure tensors. Naturality means that the diagrams 



Pa x P B 
Pa- x P B < 



Pa 

Pa'0B> 



M A xM B 

/*X9* 

IVU' x M B > 



(/»<?)* 



commute. The coherence conditions for monoidal natural transformations com- 
plete the required properties of pure tensors. 

The dinatural transformation cU : Pa x — > T> represents the evaluation 
function. Dinaturality says that for each admissible transformation / : A — > B: 

Pb x 





Pa x 

Thus we see that dinaturality is exactly the Chu morphism condition ([1]). 
Monoidality of d is the equation ©. 



3.4 Operational categories 

If we are given an operational representation (C, C t , P, M,d) we can construct 
from this a single category, recovering the picture given in Section 13.21 

For each object A of C, we have the Chu space (Pai Ma, d^)- By dinaturality 
of d , each morphism / : A — » B gives rise to a Chu morphism 

(/.,/*): (PA,MA,d A )^(PB,MB,d B ). 

By functoriality of P and M, we obtain a sub-category of Chu spaces. 

Moreover, since P and M are embeddings, we can push the symmetric 
monoidal structure on C forward to this sub-category: 

P A ®P B := P a ®b, Ma®M b := M A ® B , f*®fi := (f®f%, f*®f := (/»/')*• 
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Thus we obtain a symmetric monoidal category, whose underlying category 
is a sub-category of Chu spaces. We call this the operational category arising 
from the operational representation. 

3.5 Generalized representations 

The structural properties of operational representations and categories are inde- 
pendent of the particular choice of the monoid T> used in specifying the dinatural 
transformation d. 

We shall define a generalized operational representation with weights W, 
where (W, •, 1) is a commutative monoid with a zero element, to be a tuple 
(C, C t , P, M, d), where d now has the form 

d: Px K w 

and Kyy is the constant symmetric monoidal functor valued at W. This yields 
the definition of operational representation given previously when W = T>. 

We now have a general scheme for representing symmetric monoidal cate- 
gories as operational categories. So far, however, we have no examples. We 
shall now show how monoidal dagger categories give rise to operational rep- 
resentations in a canonical fashion, following the ideas of categorical quantum 
mechanics [7]. 

4 Monoidal dagger categories 

Monoidal dagger categories are the basic structures used in categorical quantum 
mechanics [7]. We shall briefly review the definitions, and give a number of 
examples. 

A dagger category is a category C equipped with an identity-on-objects, 
contravariant, strictly involutive functor. Concretely, for each arrow f:A—*B, 
there is an arrow f* : B — > A, and this assignment satisfies: 

i f = i, (g°fy = P°g\ /" = /• 

We define an arrow / : A — >• B in a dagger category to be a dagger-isomorphism 
if: 

/ f °/=U /o/t = l B . 

A symmetric monoidal dagger category is a dagger category with a symmetric 
monoidal structure (C, ®, /, A, p, a, a) such that 

(/®<?) t = / + ®S t 

and moreover the natural isomorphisms A, p, a, a are componentwise dagger- 
isos. 
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Examples 



• The category Hilb of separable Hilbert spaces and bounded linear maps, 
and its (full) sub-category FHilb of finite-dimensional Hilbert spaces. 
Here the dagger is the adjoint, and the tensor product has its standard in- 
terpretation for Hilbert spaces. More generally, any symmetric monoidal 
C*-category is an example [321 This includes categories of (right) 
Hilbert C*-modules, which are Hilbert spaces whose inner product takes 
values in an arbitrary C*-algebra instead of C. 

• The category Rel of sets and relations. Here the dagger is relational 
converse, while the monoidal structure is given by the cartesian product. 
This generalizes to relations valued in a commutative quantale |54j . and 
to the category of relations for any regular category [T5] . Small categories 
as objects and profunctors as morphisms behave very similarly to Rel, 
even though they only form a bicategory |16j . 

• A common generalization of FHilb and FRel, the category of finite sets 
and relations, is obtained by forming the category FMat(S), where S is a 
commutative semiring with involution. FMat(S) has finite sets as objects, 
and maps X x Y — > S as morphisms, which we think of as l X times Y 
matrices'. Composition is by matrix multiplication, while the dagger is 
conjugate transpose, where conjugation of a matrix means elementwise 
application of the involution on S. The tensor product of X and Y is 
given by X x Y, with the action on matrices given by componentwise 
multiplication. (This corresponds to the 'Kronecker product' of matrices). 
If we take S = C, this yields a category equivalent to FHilb, while if we 
take S to be the Boolean semiring {0, 1} (with trivial involution), we get 
FRel. 

• An infinitary generalization of FMat(C) is given by LMat. This category 
has arbitrary sets as objects, and as morphisms matrices M : X x Y — » C 
such that for each x £ X, the family {M(x, y)} y£ y is ^2-summable; and for 
each y £ Y, the family {M(x, y)} x ex is ^2-summable. Hilb is equivalent 
to a (non-full) sub-category of LMat. 

• If C and D are symmetric monoidal dagger categories, then so is the cat- 
egory [C, D] of functors F: C — > D that preserve the dagger. Morphisms 
are natural transformations. This accounts for several interesting models. 
For example, setting D = FHilb and letting C be a group, we obtain the 
category of unitary representations. Any topological or conformal quan- 
tum field theory is a sub-category of the case where D = FHilb and C is 
the category of cobordisms [HI [5J [S3] . Letting C be the discrete category 
N, and letting D be either FHilb or FRel, we recover FMat(D(J, /)). 

The doubling construction All of the above examples are variations on the 
theme of matrix categories. Indeed, it seems hard to find natural examples 
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which are not of this form. However, there is a construction which produces a 
symmetric monoidal dagger category from any symmetric monoidal category. 
Although the construction is formal, it is interesting in our context since it can 
be seen as a form of quantization; it converts classical process categories into a 
form in which quantum constructions are meaningful. 

Given a category C, we define a dagger category C±5 as follows. The objects 
are the same as those of C, and a morphism (/, g): A —> B is a pair of di- 
morphisms f:A—>B,g:B—>A. Composition is defined componentwise; 
while (f,gy — (g, /)• This is in fact the object part of the right adjoint to the 
evident forgetful functor DagCat — > Cat; see [371 3.1.17]. Thus for each dagger 
category C, there is a dagger functor 77c : C — > Ct^ which is the identity on 
objects, and sends / to (/, /t). This has the universal property with respect to 
dagger functors C — > D*^ for categories D. 

This cofree construction of a dagger category lifts to the level of symmet- 
ric monoidal categories. If C is a symmetric monoidal category, then is a 
symmetric monoidal dagger category, with the monoidal structure defined com- 
ponentwise: thus (/, g) (£> (h, k) :— (/ <£> h, g <E) k). Note in particular that the 
structural isos in C turn into dagger isos in C<=;. 



4.1 Additional structure 

We shall require two further structural ingredients. The first is zero morphisms: 
for each pair of objects A, B, a morphism Oa,b : A — > B such that, for all 
f:C—>A and g: B -> D, 

0A,B°f = 0c,B, 9°0a,B = 0a,D- 

Note that if zero morphisms exist, they are unique. 

In the context of symmetric monoidal dagger categories, we further require 
that 

/®0 = = 0®g, f = 0. 



Examples All the examples of symmetric monoidal dagger categories given 
above have zero morphisms in an evident fashion. Functor categories have com- 
ponentwise zero morphisms. Zero morphisms in Ct> are pairs of zero morphisms 
in C. For more examples, see [38] . 

The final ingredient we shall require is a trace ideal in the sense of [4]0 
Firstly, we recall that in any monoidal category, the scalars, i.e. the endomor- 
phisms of the tensor unit 7, form a commutative monoid |43j . 

An endomorphism ideal in a symmetric monoidal category C is specified by 
a set 1(A) C End(A) for each object A, where End(A) = C(A, A) is the set of 
endomorphisms on A. This is subject to the following closure conditions: 

g: A-> B,f eX{A),h: B -> A g o / o h G X(B) 

1 Strictly speaking, we are defining the more restricted notion of global trace of an endo- 
morphism, rather than a parameterized trace as in [4]. This restricted notion is all we shall 
need. 
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/ G 1(A) ,gel(B) f®g€l(A®B) 7 1(1) = End (J) 

61(A). 

If C is a dagger category, I is a dagger endomorphism ideal when additionally 

fel(A) => /t 6 I(A), 

but we will also call these endomorphism ideals for short. A trace ideal is an 
endomorphism ideal X, together with a function 

Tr A : 1(A) -> End(J) 

for each object A, subject to the following axioms: 

Tr A (g o /) = Tr B (/ o g) (f : A -> B,g: B -> A, g o / G J(A), / o 5 G 1(B)) 

TW»(/ <g> g) = Tr A (f)Tr B (g), Trj(s) = s. 
A dagger trace ideal additionally satisfies 

Tr A (/t) = Tr A (/)t, 

but we will also call these trace ideal for short. We call a morphism / G 1(A) 
trace class. 

Examples All of the examples given above have trace ideals. In the case of 
finite matrices, the usual matrix trace is a total operation. In the case of Hilb, 
we interpret trace class in the standard sense for Hilbert spaces, and similarly for 
LMat. Through the GNS-embedding Proposition 1.14], this also provides 
a trace ideal for any C*-category. 

In the case of relations, the summation over the diagonal becomes a supre- 
mum in a complete semilattice, which is always defined. 

Any symmetric monoidal dagger sub-category of [C,D] inherits endomor- 
phism ideals and zero morphisms from D componentwise, and has a trace 
function Tr(a) = V A Ti(a A ) as soon as T)(I,I) has an operation V satisfy- 
ing \Z a s a = (M a s a)K \/ a s = s, and (\/ A s A t A ) = (\f A s A )(\f A t A ), where A 
ranges over the objects of C. This is the case when C is a finite group, as well 
as for topological quantum field theories. 

The doubling construction turns trace ideals into dagger trace ideals. For 
(f,g): A -> A, define (f,g) G 1(A) if and only if / e 1(A) and g G 1(A), 
and Tr A (f,g) — (Tr A (f),Tr A (g)). Thus if C is a symmetric monoidal category 
with zero morphisms and a trace ideal, C<^ is a dagger category with the same 
structure. 

In Appendix B, we prove a number of results about trace ideals: 

• We characterize when trace ideals exist, and to what extent they are 
unique. 
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• We show that we really need to restrict to ideals to consider traces: the 
category of Hilbert spaces does not support a trace on all morphisms. 

• As a corollary, we derive that dual objects in the category of Hilbert spaces 
are necessarily finite-dimensional. 

• Finally, we prove in some detail that the category of separable Hilbert 
spaces indeed has a trace ideal; the details turn out to be quite subtle. 

This material would have unduly interrupted the main flow of the paper, 
but is of mathematical interest in its own right. 

5 From categorical quantum mechanics to oper- 
ational categories 

Let C be a symmetric monoidal dagger category with zero morphisms and a 
trace ideal. We shall show that C gives rise to an operational representation 
and operational category in a canonical fashion, directly inspired by quantum 
mechanics. 

5.1 Transformations 

We take C t to be the sub-category with the same objects as C, and with dagger- 
isomorphisms as arrows. This is a groupoid, i.e. all morphisms are invertiblc. 
It is easily seen to be a monoidal dagger sub-category of C. 

5.2 States 

A morphism / e End(A) in a dagger category is positive if for some g : A — > B, 
f = o g. We define a state on A to be a positive morphism / £ End(A) which 
is trace class, and such that Tr^/) = 1. We write Pa for the set of states on 
A. 

In Hilb, this definition yields exactly the standard notion of density operator 
as used in quantum mechanics. 

Pure states can also be defined in this setting. An arrow tp : I — > A has unit 
norm if tp^ o tp = 1. Given such an arrow, tp o tp^ e P^. Indeed, this arrow is 
clearly positive, and 

Tr A o V f ) = Tr/O^ o tp) = Tr/(1) = 1 

using our assumption on tp and the axioms for the trace. 

Given a dagger isomorphism /: A — > B in C, the function /* : — > P B is 
defined by 

/*: si-^/oso/t. 

Functoriality holds, since 

o /*(s) = go (f o s o f) o 5 f = (go f) o so (g o /)* = (jo/),(s). 
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Inclusion of pure tensors is given by 

i P A B : (s,t) i->- s ® t. 
It is straightforward to check the coherence conditions. 

5.3 Measurements 

A dagger idempotent, or projector, on A is an arrow P € End(A) such that 

P 2 = P, P = P\ 
A family of endomorphisms on ^4 is: 

• Pairwise disjoint if /j o fj = 0, i ^ j; 

• Jointly monic if for all g, ft: B — > A: 

[Viel.fi°g = fioh] .g = ft. 

A projective measurement on ^4 with finite set of outcomes O' C O is a family of 
dagger idempotents {P }oeO' on A which is pairwise disjoint and jointly monic. 
We take Ma to be the set of projective measurements on A. 

The functorial action of the measurement functor on dagger isomorphisms 
f:A^B'mCis defined by 

t{Po) = P°Po°f. 

It is easily verified that /* preserves disjointness and joint monicity of families 
of projectors, and hence carries projective measurements to projective measure- 
ments. Functoriality is also easily verified. 

Inclusion of tensors is defined pointwise on projectors: 

l p ab : (Po,Po>) P ®Po>. 

Note that the combined measurement will have a finite set of outcomes which, 
perhaps with some relabelling, can be regarded as a subset of O. 

5.4 Evaluation 

The transformation d is defined as follows, where s e Pa, and m = {P }oeO' € 
Ma: 

Tr A (soP ), oeO' 
0, otherwise. 



d A (s,m)(o) :-- 



Note that d is valued in the commutative monoid of scalars W := End(7)°. By 
the assumption of zero morphisms, this monoid has a zero element. 

The dinaturality of this transformation, i.e. the Chu morphism condition, is 
just: 

Tr B (/ o a o /t o P ) = Tta(s ° P ° P Q o /). 
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The monoidality of d is verified as follows: 



cU®b(s <& s',m ® m')(o, o') = Tr A ® B (s ®s'oP a ® P a ,) 



= Tt a ®b(soP ® s'oP ,) 
= Tr A (soP )Tr B (s' oP a ,) 
= d A (s,m)(o) ■ d B (s',m')(o') 



5.5 The canonical operational representation 

We collect the constructions described in this section together. Given a sym- 
metric monoidal dagger category C with zero morphisms and a trace ideal, we 
have defined a sub-category C t , monoidal functors P and M, and a dinatural 
transformation d. 

Proposition 3. The tuple (C, C t , P, M, d) is an operational representation with 
weights W. We call this the canonical operational representation of C. The 
corresponding operational category is the canonical operational category for C . 



We say that the canonical representation is distributional if the monoid of 
scalars End(7) has an addition making it a commutative semiring, and for each 
state s G P A and measurement m € M^: 



We say that it is probabilistic if moreover the image of d embeds into the semiring 
of non-negative reals. 

6 Examples of operational categories 

We shall now examine the operational categories arising from various examples 
of symmetric monoidal dagger categories. 

6.1 Hilbert spaces 

The definitions of states, measurements and evaluation are directly inspired by 
those used in the standard Hilbert-space formulation of quantum mechanics. 
Thus it is immediate that the states in the canonical representation for Hilb 
are the density matrices, while the dagger-isomorphisms are the unitary trans- 
formations. 

For measurements, we have the following result. 

Proposition 5. Measurements in Hilb have exactly their standard meaning. 
More precisely, observables with finite discrete spectra correspond exactly to the 
interpretation in Hilb of the abstract notion of measurements as defined in 
Section \5.3\ for dagger categories. 



□ 




(4) 



oeo 
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Proof. We think of the outcomes as labelling the eigenvalues of the observable; 
then the family {P }oeO' should correspond to the spectral decomposition of 
the observable. Clearly, dagger idempotents correspond exactly to projectors 
in Hilb, and so does the notion of a pairwise disjoint family of projectors. 
It remains to show that the joint monicity condition captures the fact that a 
pairwise disjoint family of projectors {Pi}i e j yields a resolution of the identity, 
i.e. 

iei 

Indeed, if J2iei ^» = ^ A anc ^ Pi ° g = Pi ° h for all i, then 

g = 1a o g = Pi)°g = y^ J Pi°g = y^ J Pi°h= Pi)oh=l A oh = h. 
iei iei iei iei 

For the converse, suppose that Yliel ^* ^ This implies that for some non- 
zero vector ip, Pi(ip) =0 for all i. Then for / : C —> A given by 1 H > ip, we have 
Pi ° f = Pi °0 for all i, so the family is not jointly monic. □ 

Finally, the definition of d matches the standard statistical algorithm of 
quantum mechanics. Thus we obtain the standard interpretations of states, 
transformations, (projective) measurements, and probabilities of measurement 
outcomes. 

The operational category arising from Hilb is of course probabilistic. 

The same analysis holds for C*-categories through their GNS-construction, 
and for subcategories of [C,Hilb] such as topological quantum field theories. 
States and measurements in such categories are just natural transformations 
whose components are states or measurements respectively. Because the tensor 
unit in such categories is the constant functor Ki, they have the same scalars 
as Hilb. Therefore the induced operational categories are probabilistic. 

6.2 Relations 

We shall now give a general analysis of the operational representation for locale- 
valued relations. This level of generality will be useful when we go on look at 
non-locality in operational categories. 

We recall that a locale |41j (also known as a frame or complete Heyting 
algebra) is a complete lattice f2 such that the following distributive law holds: 

a A \f b, = \f aAb,. 
iei iei 

The category Rel(5!) has sets as objects, while the morphisms R : X — > Y are 
il-valued relations (or matrices) R : X x Y — s- fl. We write [ccRy] = lu for 
R{x,y) = u. Composition is relational composition (or matrix multiplication) 
evaluated in IX If R: X -> Y and S: Y -> Z, then: 

{x(SoR)z] := \JlxRyjAlySzl 

y& 
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S(x) if x = y, 
_L if x 7^ y. 



Clearly, Rel is the special case that is the Boolean semiring {_L, T}, where we 
identify _L, the bottom element of the lattice, with 0, and T, the top element, 
with 1. Note that the full subcategory FRel(Q) of finite sets is identical to 
FMat(fi), where we regard 11 as a semiring with idempotent addition and mul- 
tiplication. Indeed, in the finite case, completeness of il need not be assumed, 
and we are simply in the case of matrices over idempotent semirings. 

We shall take the tensor unit in Rel(fi) to be I = {•}. 

By an f2-subset of a set X, we mean a function X — > il. Any family {Si} of 
r2-subsets of X has a 'union' \J t Si given by x i-> \J i Si(x), and an 'intersection' 
/\ t Si given by x h-> /\ i Si(x). In particular, we write T x for the fi-subset of X 
given by x i-> T, and J-x for the f2-subset of X given by x H> _L. Given a set 
X, we say that a family {Si}i e i of O-subsets of X is a disjoint cover of X if: 

S i AS j =± x V 5 '' = T ^- 

Given a f2-subset 5* of X, we define a fi- relation As : X — > X by 

Note that 

A s oA T = J_ M ^ SAT = J_ X , \/A Si = U <J=* V^ = T ^- ( 6 ) 

Proposition 7. Projective measurements on X in Rel(O) consist of families 
of relations {As^i^i, where {Si}i<=i is a disjoint cover of X . 

Proof. Clearly any family of relations of this form is a projective measurement. 
For the converse, suppose we have a projective measurement {Pi}i e j on X. 
The fact that Pi is a projector in Rel(f2) means that [xPiy] = [yP^x]] and 
[xPz] = \/ y lxPy} A [yPzJ, which implies that [xPjx]] > [xP^y]. Suppose for 
a contradiction that [xPyJ = lu > _L where x ^ y. Define R,S: I — > X 
by {•Rxj = lu = fiSyl and l»Rzj = JL = {•Szj for other z. Then [.(P o 
RM = A {zPiXj = to A [xPx] = lu since [xPx] > \yP % x\ = u, 

and also [»(P o S)xj = lu. Similarly [«(P o R)y\ = lu = [«(P o S)yj, and 
{•(Pi o R)zj = _L = [»(P o S)zj for other z. Hence P o R = P t o 5. Moreover, 
PjoR = PjoS = l. for any j ^ i, by disjointness of the family, since e.g. 
_L < \xPjz\ < [xP,x] implies P.oF^l. Thus P k oR = P k oS for all fee/, 
contradicting joint monicity. Hence Pi must have the form Pi — Ag ; for some 
Si C X. The fact that the family {Si}i^i is a disjoint cover of X now follows 
from ([6]). □ 

Next we analyze states in Rel(f2). Firstly, we give an explicit description of 
the trace. If R : X —> X is an 17- valued relation, 



.Tr x (P).]=Y[xPa 



1G 



Thus the trace can be viewed as a predicate on endo-relations, which is satisfied 
to the extent that the relation has a 'fixpoint', i.e. a reflexive element. 

Note that f2-valued relations R : I — > X of unit norm correspond to fi-subsets 
S of X satisfying 

V%o = t. 

X 

The corresponding pure state is Ps, defined by [xPsj/J = S(x) A S(y). 
We say that states s, t on X are equivalent if for all fi-subsets S of X: 

Tr x (so A s ) = Tr x (to A s ). 

Proposition 8. Every state in Rel(f2) is equivalent to a pure state. 

Proof. If s is a state on X, then it satisfies T = V ^JxsxJ, and for some relation 
R, 

{xsyj = \J{xRzj A \yRz\. 

z 

Define an i7-subset S = dom(s) of X by x n- [xsxj. We claim that s is 
equivalent to Ps- Indeed, for any f2-subset T of X, 

Tr x (s o A T ) = \/[x(s o A T )x] 

X 

= \J\xA T y\ A fysxj 
= \/ T ( x ) A l xsx l 

X 

= \/T(x)AS(x) 

X 

= \JlyPs4 A IxAryj 

= Tr x (P s oA T ). □ 

Finally, we consider evaluation. The scalars in Rel(fi) can be identified with 
the locale £2. Because states correspond to fi-subsets S satisfying S(x) = T, 
and measurements to disjoint covers, we see that equation (|4]) is satisfied. Thus 
we have the following result. 

Proposition 9. The operational cateqory arisinq from Rel(f2) is distributional. 
Proof. Let Ag be a state, and m be a measurement given by a disjoint cover 
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{S } of X. Then 

^ (Ag , to) (o) = \J Trx (Ag o A 5o ) 

O O 

= \/ [zA s2 /J A [yA So x] 
= \/ S(x)AS (x) 

o,x 

= yS(x)A(\JS (x)) 

X o 

=ys(x) 

X 

= T. □ 

Discussion These results highlight two important differences between Rel(f2) 
and Hilb as operational categories. In Hilb, every projector can appear as 
part of a projective measurement, while in Rel(f2) the collective conditions of 
disjointness and joint monicity impose the constraint that projectors have to be 
sub-identities As- Moreover, in Rel(f2) the distinction between superpositions 
of pure states, and convex combinations to form mixed states, is lost, so that 
every state is equivalent to a pure one. The relevance of this will become 
apparent when we discuss non- locality in Rel(Sl) in Section |9~^1 

7 Classical operational categories 

The construction of operational representations on monoidal dagger categories is 
directly inspired by quantum mechanics. However, operational theories should 
also include classical physics — or its discrete operational residue. Our notion 
of operational representation is indeed broad enough for this, as we shall now 
show. 

The basic classical setting we shall consider is the category Stoch. The 
objects are finite sets, and the morphisms M : X — > Y are the X x y-matrices 
valued in [0,1] which are row-stochastic. Thus for each x G X, we have a 
probability distribution on Y. 

An alternative description of Stoch is as the Kleisli category for the monad 
of discrete probability distributions; see [39] . 

The monoidal structure is defined as for FMat(5). Note that Stoch is not 
closed under matrix transposition. Indeed, we have the following result. 

Proposition 10. There is no dagger structure on Stoch. 

Proof. Note that if a category C has a dagger structure, it is in particular self- 
dual, i.e. equivalent to C op . However, the one-element set is terminal but not 
initial in Stoch, which is thus not self-dual. □ 
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It follows that we cannot directly apply the construction of Section [5] One 
might consider using the formal doubling construction on Stoch to obtain a 
dagger symmetric monoidal category with a dagger trace ideal. But this would 
not yield the expected result; for example, the dagger would not be given by 
transpose of (bi-stochastic) matrices. However, it is easy to give a direct defini- 
tion of an operational representation, as follows. 

• The sub-category Stoch t is defined by restricting to the functions (deter- 
ministic transformations), represented as matrices by their characteristic 
maps. Thus if / : X —> Y is a function, for each x G X the corresponding 
probability distribution is Sf( x y 

• A state on X is a morphism I — > X in Stoch, or equivalently a proba- 
bility distribution on X. This is the classical notion of mixed state. The 
functorial action of states is described as follows. Given /: X — > Y, we 
define 

/*0)(y) = 

• A measurement on X is a function m: X — > O with finite image O' C O. 
This is just a discrete random variable. The functorial action on / : X —> Y 
is just 

TTi i — y vn o f. 

• The evaluation is defined by: 

d x (s,m)(o) = Y s ( x )- 

m(x)—o 

The following result is easily verified. 

Proposition 11. The above data specifies a probabilistic operational represen- 
tation of Stoch. □ 

Various generalizations of this construction are possible: 

• We can generalize to 'distributions' over an arbitrary commutative semir- 
ing, as in |39) . This will still yield a distributional operational representa- 
tion. 

• We can generalize to probability measures over general measure spaces. 
This amounts to using the Kleisli category of the Giry monad [33 . 

8 Non-locality in operational categories 

Having set up a general framework for operational categories, we shall now 
investigate an important foundational notion in this general setting; namely 
non- locality. 

Throughout this section, we fix a distributional operational representation 
(C, Ct, P, M, d) on a monoidal category C. 



19 



8.1 Empirical models 

We shall begin by showing how probability models of the form commonly studied 
in quantum information and quantum foundations can be interpreted in the 
corresponding operational category. In these models, there are n agents or 
sites, each of which has the choice of one of several measurement settings; and 
each measurement has a number of distinct outcomes. For each choice of a 
measurement setting by each of the agents, we have a probability distribution 
on the joint outcomes of the measurements. 

We shall associate objects A%,...,A n with the n sites. We define A := 
Ai <£)■■■ <E) A n . We fix a state s € Pa- For each combination of measurements 
(mi, . . . ,m n ), where mi € for i = 1, . . . ,n, we obtain the measurement 
m := mi ® • • • ® m n by inclusion of pure tensors. Now the probability of 
obtaining a joint outcome o := (o%, . . . , o n ) for m is given by 

p(o\m) := d A (s,m)(o). 

We can regard these models as observational 'windows' on the operational 
theory. They represent the directly accessible information predicted by the 
theory, and provide the empirical yardstick by which it is judged. 

8.2 Non-locality 

We now define what it means for an empirical model of the kind described in 
the previous sub-section to exhibit non-locality. We shall follow the traditional 
route of using hidden variables explicitly, although we could equivalently, and 
perhaps more elegantly, formulate non-locality in terms of the (non-) existence 
of a joint distribution [30J [5] . 

We are assuming a fixed distributional model, with a semiring of weights W. 
A W -distribution on a set X is a function d : X — > W of finite support, such 
that 

E = L 

A hidden- variable model for an empirical model is defined using a set A of hidden 
variables, with a fixed distribution (id For each A € A, the model specifies a 
distribution q x (o\m) on outcomes o for each choice of measurements m. The 
required condition for the hidden variable model to realize the empirical model 
p is that, for all m and o: 

p(o\m) = E<? A (°M- d M- 

AeA 

That is, we recover the empirical probabilities by averaging over the hidden 
variables. 

2 The assumption of a fixed distribution d is technically the condition of 'A-independence' 
[25]. 
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We say that the hidden- variable model is local if, for all A € A, m = 
(mi,...,m n ), and o := (oi,...,o„): 

n 

g A (o|m) = J|g A (0i|mi). 

i=l 

Here g A (oi|mi) is the marginal: 

9 A (0i|mi) = ^ <? A (o'|m')- 

We say that the empirical model p is iocaZ if it is realized by some local hidden- 
variable model; and non-local otherwise. 

Note that the definition of non-locality makes sense for any distributional 
operational category. Thus we can lift these ideas to the general level of oper- 
ational categories. We say that an operational category exhibits non-locality if 
it gives rise to a non-local empirical model. Ultimately, we have a criterion for 
ascribing non-locality to monoidal process categories themselves, relative to a 
given distributional operational representation. 

9 Examples of non-locality 

We shall now investigate non-locality in a number of examples. 
9.1 Hilbert spaces 

As expected, the operational category arising from Hilb, which is essentially 
the finite-dimensional part of standard quantum mechanics, does exhibit non- 
locality. 

As a standard example — essentially the one used by Bell in his original 
proof of Bell's theorem — consider the following table. 





(0,0) 


(1,0) 


(0,1) 


(1.1) 


(a, b) 


1/2 








1/2 


(a,b') 


3/8 


1/8 


1/8 


3/8 


(a',b) 


3/8 


1/8 


1/8 


3/8 


(a>,b>) 


1/8 


3/8 


3/8 


1/8 



It lists the probabilities that one of two outcomes (0 or 1) occurs when simul- 
taneously measured with one of two measurements at two sites (a or a' at the 
first site, and b or b' at the second). This table can be realized in quantum 
mechanics, e.g. by a Bell state, written in the Z basis as 

in) + in) 

subjected to spin measurements in the XF-plane of the Bloch sphere, at a 
relative angle of ir/3. 
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A standard argument (see e.g. [HI 03) shows that this table cannot be real- 
ized by a local hidden- variable model. 

The same reasoning applies to C*-categories and subcategories of [C, Hilb], 
taking the hidden variables componentwise. The constant functor valued e.g. at 
the model described above then still shows that such operational categories are 
non-local. 

9.2 Relations 

Suppose we are given an empirical model in the distributional operational cat- 
egory obtained from Rel(f2). The types are sets X%, . . . ,X n , there is a state 
s = As for a fi-subset S of X := JJ i Xi satisfying \] x Six) = T, and mea- 
surements rrii — {A S i} oe o' j where {SI} is a disjoint cover of JQ. For each 
combination of measurements m and outcomes o, we have: 



We shall now construct a local hidden-variable model which realizes this 
empirical model, using the elements of X as the hidden variables. We define the 
distribution d s on X as x i— ► S(x). Note that we are working over f2 (the locale 
of scalars in Rel(O)), so this is a well-defined distribution, which sums to 1. 

We define p x (o\m) = /\iSg (xt), so this hidden- variable model is local by 
construction. 

We must verify that this model agrees with the empirical model. This comes 
down to the following calculation for o € O': 



p(o\m) = \/ S(x) A Sl(x) = \/ S(x) A /\ S^x) = \J p x (o\m) A d s (x). 



We conclude from this that Rel(O), despite being a 'quantum- like' monoidal 
dagger-category, does not admit non-local behaviour. This stands in interesting 
counter-point to the fact that, as shown extensively in [3], relational models can 
be used to give 'logical' proofs of non-locality and contextuality, in the style of 
'Bell's theorem without inequalities' [33]. The key point is that these logical 
proofs are based on showing the non-existence of global sections compatible 
with a given empirical model; while here we are looking at empirical models 
generated by states in Rel(Q), which are exactly sets of global elements. 

The key feature of quantum mechanics, by contrast, is that quantum states 
under suitable measurements are able to realize families of probability distribu- 
tions which have no global sections. 

Bearing in mind that finite-dimensional quantum mechanics corresponds to 
the operational category arising from FMat(C), while FRel(O) is FMat(O), 
this shows that idempotence of the scalars implies that only local behaviour can 
be realized; thus non-locality can only arise in non-idempotent situations. 




X 



X 



X 
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9.3 Classical stochastic maps 

We now consider the case of classical stochastic maps, as discussed in Section 
This is in fact quite similar to the case for Rel. Given an empirical model 
realized by sets Xi, . . . , X n , a state s which is a probability distribution on 
X := Y[i Xi, and measurements m,i : Xi — > O, we again take the hidden variables 
to be the elements of X. We can write s as a convex combination 

x£X 

Note that fi is a probability distribution on X. We can define p x (o\m) := 
5 (m(xj). Clearly p x (o\m) = Y\ i p x (oi\rrii), so this hidden-variable model is 
local. 

It is straightforward to verify that the probabilities p(o\m) are recovered by 
averaging over the deterministic hidden variables. 

Thus we conclude, as expected, that Stoch does not exhibit non-locality. 

In fact, we can say more than this. We can calibrate the expressiveness of an 
operational theory in terms of which empirical models it realizes. We shall now 
show that Stoch realizes exactly those models which have local hidden-variable 
realizations. 

To see this, suppose we are given sets of measurements Mi, . . . , M n . We 
define M := |J i Mj, the disjoint union of these sets of measurements, and X := 
M . Thus elements of X simultaneously assign outcomes to all measurements. 
For each m = (mi , . . . , m n ) € J\ i Mi, we define a map rh : X — > O by 

m: 14 (x(mi), . . . , x(m n )). 

For each m, we get the probability distribution on outcomes given by 

d m : o4 s(x). 

m(x)—o 

This is the empirical model realized by the state x, viewed as a probability 
distribution on the hidden variables X; and as shown e.g. in [5], all local models 
are of this form. 

9.4 Signed Stochastic Maps 

We shall now consider a variant of Stoch which has much greater expressive 
power in terms of the empirical models it realizes. This is the category SStoch 
of signed stochastic maps; real matrices such that each row sums to 1. Thus for 
each input, there is a 'signed probability measure' on outputs, which may include 
'negative probabilities' [5D1 HS1 221 HH]- An operational representation can be 
defined for SStoch in the same fashion as for Stoch; it is still distributional. 

The following result can be extracted from [5] Theorem 5.9], using the same 
encoding of empirical models which we employed in the previous sub-section. 
The reader should refer to [5, Theorem 5.9] for the details, which are non-triviai. 
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Proposition 12. The class of empirical models which are realized by the op- 
erational category obtained from SStoch are exactly the no-signalling models; 
thus they properly contain the quantum models. 

This says that the operational category obtained from SStoch is more ex- 
pressive, in terms of the empirical models it realizes, than the canonical opera- 
tional category derived from Hilb, which corresponds to quantum mechanics. 

Example We consider the bipartite system with two measurements at each 
site, each with outcomes {0, 1}. Thus the disjoint union M of the two sets 
of measurements has four elements, and X = {0, 1} M has 16 elements. Now 
consider the following state: 

x := [1/2,0,0,0,-1/2,0,1/2,0,-1/2,1/2,0,0,1/2,0,0,0]. 

The distributions it generates for the various measurement combinations can be 
listed in the following table. 





(0,0) 


(1,0) 


(0,1) 


(1,1) 


(a,b) 


1/2 








1/2 


(a',b) 


1/2 








1/2 


(a,b') 


1/2 








1/2 


(a',b') 





1/2 


1/2 






This can be recognized as the Popescu-Rohrlich box |52j . which achieves super- 
quantum correlations. 

The state x can be obtained from the PR-box specification by solving a 
system of linear equations; see [5] for details. 

10 Final remarks 

This paper makes a first precise connection between monoidal categories, and 
the categorical quantum mechanics framework, on the one hand, and operational 
theories on the other. Clearly, this can be taken much further. We note a 
number of directions which it would be interesting to pursue. 

• We have used our framework of operational categories to study non- 
locality in a general setting. In particular, we have a clear definition of 
whether a model of categorical quantum mechanics exhibits non-locality 
or not, as explained at the end of Section |SJ As we saw, while Hilbert- 
space quantum mechanics does, the category of sets and relations, which 
forms a very useful 'foil' model for quantum mechanics in many respects 
[591 122] . does not. An important further direction is to apply a similar 
analysis to contextuality, which can be seen as a broader phenomenon, 
of which non- locality is a special case. In [5], a general setting is devel- 
oped allowing a unified treatment of contextuality and non-locality. We 
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would like to extend the present account to this setting, in which com- 
patibility of measurements is explicitly represented, leading to a natural 
sheaf-theoretic structure. 

• Such a development would also lead to a more satisfactory treatment of 
outcomes, in place of the somewhat clumsy device used in the present 
paper. 

• It would also be interesting to interpret some of the general results which 
have been proved for operational theories, relating e.g. to no-broadcasting 
[TT] . teleportation (T2], and information causality 10 , in our categorical 
framework, and ultimately to obtain such results for classes of monoidal 
process categories. 

• We would also like to examine the issue of axiomatization or 'reconstruc- 
tion' of quantum mechanics from the categorical point of view. 

• There are various constructions for turning a monoidal category of 'pure' 
states into one of 'mixed' states [SHI [23] ■ It would be interesting to re- 
late these constructions to our canonical operational categories. Similarly, 
there is a category embodying Spekkens' toy theory (5!B[3T]. It would be 
of interest to study the associated operational category. 

Regarding related work, we note that in [T3] , the structure of the concrete 
category of convex operational theories is investigated. 
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A First notions from category theory 

We shall review some basic notions from category theory. For more detailed 
background, see [9]. 

A category C has a collection of objects A, B,C, . . ., and arrows /, g,h, . . .. 
Each arrow has specified domain and codomain objects: notation is / : A — > B 
for an arrow / with domain A and codomain B. The collection of all arrows 
with domain A and codomain B is denoted as C(A, B). Given arrows / : A — > B 
and g: B — > C, we can form the composition g o / : A — > C. Composition is 
associative, and there are identity arrows 1a ■ A — > A for each object A, with 
/ o l A — /, l A o g = g, for every / : A — > B and g : C — » A. An arrow / : A — > B 
is called an iso (morphism) when / o / _1 = 1 B and / _1 o f = 1 A for some arrow 
/ _1 : B — > A. An arrow /: A — > B is split monic when g o / = 1^ for some 
g : B — > A, and it is split epic when / o g = 1 A for some g : B — > A; by abuse of 
notation, we will write g = / _1 in both cases. 
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If C is a category, we write C op for the opposite category, with the same 
objects as C, and arrows A — » B corresponding to arrows B — > A in C. 

If C and D are categories, a functor F : C — > D assigns an object FA of 
D to each object A of C; and an arrow Ff: FA —> FB of D to every arrow 
/: A — »• B of C. These assignments must preserve composition and identities: 
F(g o /) = F(g) o F(f), and F(1 A ) = 1 FA . 

Given functors F, G: C — ¥ D, a natural transformation t: F — » G is a family 
of arrows -F 1 ^ — >■ GA} indexed by the objects of C, such that, for every 
/: A — > B in C, the following naturality diagram commutes: 

FA-^GA 



Ff 



Gf 



FB ^GB 

ts 

A natural isomorphism is a natural transformation whose components are iso- 
morphisms. An equivalence of categories is a pair of functors F: C — > D and 
G: D — > C such that there are natural isomorphisms FoG = Id and GoF = lc- 
A symmetric monoidal category is a structure (C, <£>, 7, a, A, p, cr) where: 



C is a category; 

<gi:CxC— s-Cisa functor {tensor)] 
I is a distinguished object of C (unit); 

a, A, p, a are natural isomorphisms (structural isos) with components 
a A .B,c ■ A ® (B ® C) (A ® B) ® C 
\ A : I ® A ^ A p A : A® I ->• A 

o~ Aj b : A® B ^ B® A 
such that certain coherence diagrams commute. 

Products are a classical example of symmetric monoidal structure; the category 
is then called Cartesian. The symmetric monoidal structure can also support 
entanglement; the category is then called compact [7]. 

Let C and D be symmetric monoidal categories. A symmetric monoidal 
functor 

(F,e,m): C -> D 

comprises 

• a functor F: C — > D, 

• an arrow e : 2d - ► -F^c > 

• a natural transformation toa,b : g3 — > F(A <S> B), 
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subject to coherence conditions with the structural isomorphisms. The sym- 
metric monoidal functor is called strong when m is a natural isomorphism. 

Let (F, e, to), (G, e', m') : C — > D be symmetric monoidal functors. A monoidal 
natural transformation between them is a natural transformation t : F — > G such 
that the following diagrams commute. 




FA ® FB U> F(A <g> E) 

GA <g> G-B — ^ G(A <g> B) 



B Trace ideals 

This appendix further studies the notion of trace ideal, introduced in Section ^. 11 
It presents several technical results that are mathematically interesting, but 
would break up the flow of the main text. For example, we characterize when 
trace ideals exist, and to what extent they are unique. Also, we show that we 
really need to restrict to ideals to consider traces: the category of Hilbert spaces 
does not support a trace on all morphisms. As a conceptually satisfying corol- 
lary, we derive that dual objects in the category of Hilbert spaces are necessarily 
finite-dimensional. Finally, we prove in some detail that the category of separa- 
ble Hilbert spaces indeed has a trace ideal; this was claimed in Section 14. 1 L but 
the details are quite subtle. 



B.l Existence 

The question whether a category allows a trace ideal at all can be answered as 
follows. 

A subcategory D of C is called tracial when cndomorphisms in C factoring 
through D can only do so in a way unique up to isomorphism. More precisely: 
if ft : X -> Y, h : Y -> X, ft : X -> Y', ft : Y' -> X are morphisms of C, and 
Y and Y' are objects of D, and fa o f 1 = ft ° f[> then there is a morphism 
i : Y Y' in D that is either split monic or split epic, such that ft = i° ft and 

ft = ho%-\ 




The category C is called traceable when the full subcategory consisting of the 
monoidal unit / is tracial. Notice that traceability generalizes the fact, holding 
in any monoidal category, that the scalars are commutative. 
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Proposition 13. Any dagger monoidal tracial subcategory Do/C with a trace 
ideal induces a trace ideal 

2(X) = {f e C(X, X)\f = f 2 o h with h- X -+YJ 2 :Y ^ X andY in D, f x o f 2 e X D (^)} 
Tr(/)=Tr D (/ 1 o/ 2 ) 

on C. 

Proof. One directly checks that is an endomorphism ideal; in particular 

1(1) — T)(I,I) = C(I,I). Because D is tracial, Tr is well-defined. The axioms 
for the trace function are also readily verified. □ 

Theorem 14. A dagger monoidal category has a unique minimal trace ideal 

1(X) = {f:X->X\f factors through 1} 

Tr(/) = b o a, when f = a o b with a: I — > X and b: X — > / 

and hence has any trace ideal whatsoever, if and only if it is traceable. 

Proof. That the given data form a trace ideal follows from the previous propo- 
sition, because the full subcategory consisting of just the monoidal unit I is 
certainly (totally) traced. To see that this trace ideal is minimal, i.e. that any 
trace ideal must contain this one, follows from the first and third axioms of 
endomorphism ideal. □ 

As a consequence of the previous theorem, the evaluation of measurements 
on pure states is completely determined by the structure of the category, in- 
dependent of the trace ideal. If s = tf) o ^>t [ s a p Ure state on X, and {P } a 
measurement, then for every outcome o: 

Tr(s o P a ) = Tr(V> o^o P Q ) = Tr(^ f o P o V) = ^ o P o ?/>. 

Therefore, the only possible freedom the choice of a trace ideal brings comes 
out in behaviour on mixed states. 



B.2 Uniqueness 

We now consider uniqueness of trace ideals. The following proposition proves 
that trace ideals are a categorical invariant, in the sense that they are preserved 
under equivalence. A dagger monoidal equivalence is a pair of functors F : C — > 
D and G: D — > C that form an equivalence of categories, such that F(f jf ) = 
F(fy and G(f') = G(fy, and there are natural isomorphisms F(I) = /, 
G(I) = I, F(X <8> Y) = F(X) O F(Y) and G(X ® F) S G(A) ® G(F) that 
interact with the coherence isomorphisms in the appropriate way. 

Proposition 15. Trace ideals are preserved under dagger monoidal equivalence: 
if F : C > D andG: D — > C are strong monoidal functors that preserve daggers 
and form an equivalence of categories, and (I, Tr x ) is a trace ideal in C, then 

J(X) = G-\1{G{X))) - {g e D(A,A) | G( S ) e I(G(I))}, 

Tr^(.g)= J F(Tr5 (x) (G(. 9 ))), 
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form a trace ideal in D. 

Proof. First, observe that if / € I(X), and g: X —> F is an isomorphism 
with inverse h, then Tr(/) = Tr(gfh). Then, to verify that J is an endomor- 
phism ideal, the first requirement follows from functoriality of G; the second 
from the fact that G is monoidal; and the third from fullness of G together 
with monoidality of G. It is a dagger endomorphism ideal because G preserves 
daggers. Verifying that Tr" 7 satisfies the requirements is completely analogous, 
except that the last condition additionally uses F(G(s)) = s. □ 

However, trace ideals need not be unique. In fact, there may even be more 
than one trace function making a fixed endomorphism ideal into a trace ideal, 
as the following example shows. 

Example 16. A tracial state on a C*-algebra A is a linear map t: A — > C 
satisfying r(a*a) > 0, r(l) = 1, and r(ab) — r(ba). There exists a unital 
C*-algebra A with distinct tracial states r//: A — > C [15] . 

Make a category C as follows. Objects are natural numbers. There are only 
endomorphisms. Morphisms — > are complex numbers; the identity is 0, and 
composition is addition. For n > 1, morphisms n — > n are elements of the 
n-fold direct sum A © A © • • • © A; the identity is (1,1,..., 1), and composition 
is pointwise multiplication. 

We give this category a monoidal structure by letting the tensor product of 
objects n and to be n + to. If one of n or to is 0, the action on morphisms is 
by scalar multiplication. For n,m> 1, the action on morphisms is clear. The 
monoidal unit is the object 0. 

Taking T(X) to be all endomorphisms on X certainly gives an endomorphism 
ideal. Define Tro(z) = z, and Tr„(ai, . . . , a„) = X)"=i r ( a «) ^ or n — This 
satisfies all the conditions needed to make 1 into a trace ideal. But the very 
same construction with t' gives a different trace function. 

The previous example is in stark contrast to Cartesian categories or com- 
pact categories, where traces are unique; see [58] and [36], respectively. The 
counterexample above is somewhat artificial, because all morphisms are endo- 
morphisms. It remains unclear whether trace ideals on, for example, compact 
categories, are unique. 

B.3 The need for trace ideals 

We will now show that in the category Hilb, there exists no trace ideal consisting 
of all morphisms. More precisely, we will show that Hilb is not an instance of 
the established notion of traced monoidal category |42j . This notion asks not 
just for traces of all endomorphisms, but requires a 'partial trace' of morphisms 
/: X ®U ^ Y ®U , resulting in a morphism Tr c/ (/) : X — > Y . There are then 
several additional axioms, such as the following naturality: 

Tr C7 (/)o 5 = Tr c/ (/o( 5 (g)l C ;)) for/: X ® U -> Y © U, g : X' X. 
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We will now show that the monoidal category (Hilb, ®) cannot be traced 
monoidal. Subsequently, we will show that it does have a trace ideal. This justi- 
fies working with trace ideals in monoidal categories instead of traced monoidal 
categories. We are indebted to Peter Selinger for the following proof. 

Lemma 17. Suppose (Hilb,®) is traced monoidal. Then Tr(/ + g) = Tr(/) + 
Tr(<?) for all endomorphisms f,g: H — > H. 

Proof. Choose an orthonormal basis {|0), |1)} for C 2 , and write |+) = |0) + |1). 
Recall that C 2 ® H ^ H © H . Define F : C 2 <8> H -)• H via the block matrix 
(/ g). Hence Fo (|0) ® l H ) = f and F o (|1) ® 1#) = g. Now: 

Tr(/ + g ) = Tr(Fo(|+)®l H )) 

= Tt(F) O |+) (by naturality) 

= (Tr(F)o|0)) + (Tr(F)o|l}) 

= Tr(Fo (|0) <g> Iff)) +Tr(Fo (|1) ® l ff ) (by naturality) 

= Tr(/) + Tr( 5 ). 

The third equality uses that composition is bilinear. □ 
Theorem 18. The monoidal category (Hilb,®) is not traced monoidal. 
Proof. Suppose (Hilb, ®) was traced monoidal. Let H be an infinite-dimensional 



Hilbert space. Then there exist isomorphisms ip: H 

C © H. Write them in block matrix form as ip = {tpx ip^) and tp 

Consider the morphisms fx , fi , / 3 : H © C © U — > i? 
following block matrices. 

/i = 



# and V>: # 
iJ given by the 




h = 




h = 




Let 3 = ip © V : (# © C) © 



/f © (Cffiff). Then 





= fi° 9- 

\0 0/ \0 ^a/ 
Hence 

Tr(A) - Tr(/i o fl o .g" 1 ) = Tr( 3 o / 2 o g" 1 ) = Tr(/ 2 o g- 1 o ff ) = Tr(/ 2 ). 

But Tr(/ 2 ) = Tr(/i + / 3 ) = Tr(/ X ) + Tr(/ 3 ) by LemmaEI And because f 3 has 
finite rank, we know that Tr(/ 3 ) = Tr(l c ) = 1. Thus Tr(/ 2 ) = Tr(/ 2 ) + l, which 
is a contradiction. □ 
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B. 4 Dual objects in Hilb are finite-dimensional 

The previous theorem allows an interesting corollary. Recall that the main 
characteristic of compact categories is that objects have duals: objects L, R in 
a monoidal category are called dual when there are maps r\ : I — > R ® L and 
e: L <£) R — > I making the following two composites identities. 

L^L®I L ® (R® L) (L ® R) ® L I®L^L 
R = I®R {R® L)® R = R® (L® f?) R®I = R 

It is well-known that if if G Hilb is finite-dimensional, then if and if* are dual 
objects by 77(1) = K) ® (*l an d — 1; f° r an y choice of orthonormal 

basis {\i)}i=i,...,n for if; see (43j [42l [7] . This recipe does not work when if is 
infinite-dimensional, because J2i K) does not converge in that case. However, 
this does not exclude the possibility that there might be other H*,i],e making 
if into a dual object. No rigorous proof that infinite-dimensional Hilbert spaces 
cannot have duals has been published, as far as we know. 

Corollary 19. Objects in (Hilb, ®) with duals are precisely finite- dimensional 
Hilbert spaces. 

Proof. Let if be an infinite-dimensional Hilbert space. Suppose if has a dual 
object if*. For /: if — >• ff , define Tr H (f) as the following composite. 

I^kH*®H = H®H* m "') H(g>H* A I 

This satisfies all equations for a trace function, as far as these make sense 
'locally', for just one object ff. In Hilb, the object C always has a dual, 
and if if and K have duals, then so does if K. Now, notice that the proof 
of Theorem [TBI only uses the trace properties 'locally', i.e. for the objects 

C, ff, C 2 <g) ff = ff 8 if, if © C, if © C © ff. Hence the contradiction it results 
in holds here, too. □ 

In fact, in any monoidal category with biproducts, one can show that if 
A = A © I , then Tr^U) = Tr A (l A ) + 1. We thank Jamie Vicary for this 
observation. 

B.5 Trace class maps form a trace ideal in Hilb 

To show that the usual trace of continuous linear maps between separable 
Hilbert spaces does in fact give a trace ideal requires some work, as virtually 
all textbooks only consider endomorphisms, whereas the defining conditions of 
trace ideals also involve morphisms between different objects. 

We need to recall some terminology; for any unexplained terms, we refer 
to [17] . Other good references are [31] [57] . A linear map f:H—>K between 
Hilbert spaces is Hilbert- Schmidt when ^ n ll/( e n)|||: < 00 for an orthonormal 
basis (e n ) of if. A positive continuous linear map /: if — > ff is trace class 
when ^2 n I (e„ | /(e n ))| < 00 for an orthonormal basis (e n ) of if. An arbitrary 
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continuous linear map / : H — > H is trace class when its absolute value |/| : H — > 
H is trace class. Both definitions arc independent of the choice of basis (e„). 
If / is trace class, then (e„ | /(e„)) is absolutely summable, and hence the 
following trace property holds: 

Tr(/)=^> n |/(e n )) 

n 

is a well-defined complex number. The Cauchy-Schwarz inequality states that 

|My)|<(IMI 2 -lbll 2 ) 1/2 

for any two elements x, y of a Hilbert space. The Holder inequality states that 

EK-y«i<(EKi 2 ) 1/2 -(Ei^i 2 ) 1/2 

n n n 

for any two sequences (x n ) and (y n ) of complex numbers with J2 n \ x n\ 2 < oo 

and En \Vn\ 2 < OO. 

/ 

Lemma 20. Let H < ' K be morphisms in Hilb. Then go f is trace class if 
and only if f and g are Hilbert- Schmidt. 

Proof. By polar decomposition, there is a unique partial isometry w: H — > K 
satisfying go f = w o\g o f \ and ker(w) = ker(g o /). It follows that \g o f \ = 
w ^ ° 9 ° /• Hence, for an orthonormal basis (e„) of H, 

E K e " ' Iff ° /I (en)) | = E K e " I wf ° 9 ° /(e«)) | 

n n 

= E W ° w ( e «) I /( e «))| 

n 

^ E C 1 1 ° W ( e «)H 2 • ll/(e«)|| 2 ) 1/2 (by Cauchy-Schwarz) 
n 

= EHl5 t °«'(e n )||-||/(e n )||| 

n 

<(Ell5 f ° ^(^)H 2 ) 1/2 • ( E ll/MH 2 ) 172 - Holder) 

Therefore g/ is trace class if and only if J2 n ||/(e„)|| 2 < oo and J2 n llff ° 
w(e„)|| 2 < oo. Because w is a partial isometry, the latter inequality holds if 
and only if ^2 n ||g(e„)|| 2 < oo. That is, g o f is trace class if and only if / and 
g are Hilbert-Schmidt. □ 

Proposition 21. The category Hilb has a dagger trace ideal consisting of the 
usual trace class maps and the usual trace function. 
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Proof. That the trace class maps on a Hilbert space H are closed under adjoint 
and tensor products is easily seen. Also, any morphism C — > C is trivially trace 
class. Now suppose that / : H — > H is trace class. By the previous lemma, we 
can write f = fi° fi for Hilbert-Schmidt maps /j. If g : H — > K and h: K — > i7 
are arbitrary morphisms, then g o f 2 and fi o h are again Hilbert-Schmidt. 
Therefore, by the previous lemma again, g o f oh = (go f 2 ) o (/i o /i) is trace 
class. Thus trace class maps indeed form an endomorphism ideal. 

One easily sees from the trace property that trace is the identity on scalars, 
is multiplicative on tensor products, and preserves daggers. To prove that Tr(go 
/) = Tr(/ o g) for / : H — > K and g: K — > H with both fog and g o f trace 
class, we rely on Lidskii's trace formula: if h is trace class, then (%2 n \ n (h) is 
absolutely convergent and) Tr(/i) = ^ n X n (h), where A„(/i) are the eigenvalues 
counted up to algebraic multiplicity. But go f and fog have precisely the same 
spectrum, so that Tr(g o f) = £) n Ki.9 ° f) = E„ M/ ° 9) = Tr (/ ° 9)- Hence 
trace class operators indeed form a trace ideal. □ 

We have written the above example out in more detail than the reader 
might have thought necessary, because it is easy to overlook subtleties. For 
example, it is not true that if / : H — > K and g: K — > H arc morphisms such 
that g o f is trace class, then / o g is trace class, too. For a counterexample, 
let H = K = £ 2 (N), and define f(x,y) = (0,x) and g(x,y) = (x,Q). Then 
certainly go f = is trace class. But it is easy to see that P(x, y) = (y, 0), that 
g = = g^ o g, and hence that g — g^ o g = (/ o g)^ o (/ o g) > 0. Therefore 
= 9, and 

Tr(/o ff ) = y^(]/og|(e TO ,e„) | (e m ,e n )) = ^(e m | e m ) + (0 | e„) = dim(H) = oo, 

m,n m.n 

so that / o g is not trace class. 
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